Total least squares based subband modelling for scalable speech representations with damped sinusoids
نویسندگان
چکیده
We describe how Total Least Squares (TLS) algorithms can be applied as a powerful and eÆcient modelling tool for wideband speech. A detailed description in both time domain and frequency domain illustrates how the modelling functions { damped sinusoids { naturally synthesise non-stationary signals. Straightforward implementations of TLS applied to fullband speech are known to be computationally hard and they can suffer from numerical sensitivity. In this paper we introduce a subband approach, which leads to a signi cant reduction of the computational load with an enhanced numerical stability. Moreover, it enables to control the distribution of the TLS components over the spectral range of the input signal such that perceptual criteria can be incorporated in the modelling scheme. We also address the scalability of our design from smallband speech to high quality audio, and provide evidence for the existence of coupled components in TLS modelled segments.
منابع مشابه
Perceptual audio modeling with exponentially damped sinusoids
This paper presents the derivation of a new perceptual model that represents speech and audio signals by a sum of exponentially damped sinusoids. Compared to a traditional sinusoidal model, the exponential sinusoidal model (ESM) is better suited to model transient segments that are readily found in audio signals. Total least squares (TLS) algorithms are applied for the automatic extraction of t...
متن کاملFrequency and Damping Estimation Methods – an Overview
This overview paper presents and compares different methods traditionally used for estimating damped sinusoid parameters. Firstly, direct nonlinear least squares fitting the signal model in the time and frequency domains are described. Next, possible applications of the Hilbert transform for signal demodulation are presented. Then, a wide range of autoregressive modelling methods, valid for dam...
متن کاملSpeech synthesis using damped sinusoids.
A speech synthesizer was developed that operates by summing exponentially damped sinusoids at frequencies and amplitudes corresponding to peaks derived from the spectrum envelope of the speech signal. The spectrum analysis begins with the calculation of a smoothed Fourier spectrum. A masking threshold is then computed for each frame as the running average of spectral amplitudes over an 800-Hz w...
متن کاملACOUSTICS2008/2066 Damped sinusoids and subspace based approach for lossy audio coding
The new subspace-based techniques recently introduced appear to be well adapted for the parameters estimation of a damped sinusoids + noise signal model. These High-Resolution (HR) methods have a better frequency resolution than the Fourier analysis, but they are rarely used in audio coding. Although HR methods would be suitable for parametric coding at low bitrates, we show that they are also ...
متن کاملDevelopment of high quality acoustic subband echo canceller using dual-filter structure and fast recursive least squares algorithm
A high quality acoustic subband echo canceller is developed based on a dual-filter structure and the fast recursive least squares (FRLS) algorithm. Methods for overcoming the instability problem of the FRLS algorithm and implementing it using the 32-bit fixed-point arithmetic are presented. A new tap-weight transfer method, which assures double talk detection, is proposed. Computer simulations ...
متن کامل